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How to be causal: time, spacetime, and spectra 
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I explain a simple definition of causality in widespread use, and indicate how it links to the 
Kramers Kronig relations. The specification of causality in terms of temporal differential eqations 
then shows us the way to write down dynamical models so that their causal nature in the sense 
used here should be obvious to all. To extend existing treatments of causality that work only in the 
frequency domain, I derive a reformulation of the long-standing Kramers Kronig relations applicable 
not only to just temporal causality, but also to spacetime "light-cone" causality based on signals 
carried by waves. I also apply this causal reasoning to Maxwell's equations, which is an instructive 
example since their casual properties are sometimes debated. 
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I. INTRODUCTION 

Causality is a basic concept in physics - so basic, in 
fact, that it is hard to conceive of a useful model in 
which effects do not have causes. Indeed, the whole point 
of a physical model could be said to describe the pro- 
cess of cause and effect in some particular situation. But 
what do we generally mean by word like "causality" , and 
phrases such as "cause and effect"? Usually, we mean 
that the cause of any event must not be later than any 
of its effects. But even such simple-sounding statements 
are rarely as uncomplicated as they seem: when trying 
to clarify the details and built-in assumptions, it is easily 
possible to get into philosophical discussions , issues 
regarding statistical inference and induction || , or par- 
ticular physical arguments [|J. Here I instead follow the 
physics tradition characterized by Mermin as "shut up 
and calculate" ||. But what should we calculate, and 
howl 

Note that common expressions such as F = ma do 
not express a causal relationship in the sense used here. 
They provide no means of telling whether the force F 
is supposed to cause an acceleration a, or a cause F, or 
even if the equation is instead intended as a constraint of 
some kind. Instead, we start with differential equations 
containing temporal derivatives, which are open-ended 
specifications for the future behaviour. They require only 
a knowledge of initial conditions and the on-going be- 
haviour of the environment to solve. 



Additionally, since any effect R must not occur be- 
fore its cause Q, a mathematical expression for R must 
become non-zero only after that cause. This desired be- 
haviour matches that of the mathematical step function 
h(t), which has h(t) = for t < 0, and h{t) = 1 for t > 0. 
The definition of causality applied by this temporal step 
function is the same as that enforced in the frequency 
domain by the famous Kramers Kronig relations ^ 
Therefore I call the time-step causality discussed here 
"KK causality" , to distinguish it from alternative defini- 
tions. 

After describing the role of differential equations and 
how they can generate the time-step criteria in sectio n p , 
I consider causality in the spectral domain in section [HI , 
and discuss typical models in section IV. Causality and 
Maxwell's Equations is discussed in section |v|, followed 
by the development of a new spacetime formulation of the 
Kramers Kronig relations in section VI. After a discus- 
sion in section VII , I summarize in section VIII . There 
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is also an appendix (|IX|) which considers the results of 
section [n] applied to discretized (or numerical) models. 

This undergraduate level presentation introduces 
causality as a topic in itself, making specific reference 
to the construction of causal models, rather than being 
a brief diversion on the way to discussing the Kramers 
Kronig relations, as is the case in many textbooks. It de- 
tails the connection between time and frequency domain 
representations and then extends this to a full space- 
time "wave-cone" causality. I envisage that this discus- 
sion could be incorporated most easily into relativity or 
electromagnetism courses, although parts could be inte- 
grated into courses on mechanics or wave motion. 



II. CAUSAL DIFFERENTIAL EQUATIONS 

Let us first write down a simple model, where some 
system R responds to its local environment Q. Here R 
can be any quantity - e.g. a position or velocity, a level 
of excitation of some system, and even - if a position r is 
also specified - a probability distribution or wave func- 
tion. Likewise Q might be anything, depending on some 
pre-set behaviour, the behaviour of R or other systems, 
or (e.g.) spatial derivatives of fields, potentials, distribu- 
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FIG. 1: A delta function impulse at a time to is the cause 
Q — S(t — to) of an effect (a step-change in value) on R = 
h(t — to), as a result of the causal model specified in eqn. dfl). 



tions, and so on. Whatever the specific meaning of R (or 
indeed of Q), we start by writing the simplest possible 
differential equation 

d t R(t) = Q(t), (1) 



tives e.g. 

N N-l 

Y T n d\ l R(t) = Y, a ™drQ(t), (2) 

n— m— 

for parameters T n , a m , and a defined maximum order 
of derivatives N, with 0. These will remain KK 

causal as long as the highest derivative on the RHS is of 
lower order than that on the LHS ||. 

We might consider recasting the differential equations 
used here in an integral form; e.g. for an evolution start- 
ing at a time ij, eqn. (Q) becomes 

R(t) = J Q(t')dt', (3) 

although in most cases this is not as easy as writing down 
the differential equation. Also, as discussed next, differ- 
ential equations make it easier to consider the spectral 
properties. And on a more intuitive note, writing down 
a differential equation does not imply you have solved it 
- it is a notation more compatible with the concept of an 
unknown future outcome, dependent on as-yet unknown 
future causes. This point of view becomes clearer when 
discretized forms of the differential models are consid- 
ered, as done in the appendix fllX|). 



where dt is just the time derivative d/dt. To deter- 
mine how causal this model equation is, consider the case 
where the environment contains a simple possible cause: 
a brief delta-function impulse, where Q(t) = QoS(t — to). 
Reassuringly, if I integrate eqn. (Q), then R(t) will gain 
a step at to - i.e. the effect of the impulsive Q is to cause 
R to increase discontinuously by Qo at to', as depicted on 
fig. @. Thus we see how the step function h(t) arises di- 
rectly from the most basic temporal differential equation. 
If we apply ft, as a filter to R, but it changes nothing (in 
the example above, R(t) = R(t)h(t — t )), then we know 
that R(t) is causal, since all effects occur after the cause 
Q = QoS{t-t ). 

I describe this situation, where an impulse gives rise 
to a stepped response as "barely causal" , because part 
of the effect is simultaneous with the cause. Other re- 
sponses (e.g. see section IV) are usually more compli- 



cated and contain higher-order time derivatives, leading 
to a more gradual response. E.g., a differential equation 
with second order time derivatives has a ramp-like (lin- 
ear) response to a delta function cause, whereas third 
order derivatives give a quadratic response. 

Simple examples from kinematics can illustrate the 
meaning of "causal" as used here. If we were to write 
down dtX = v, then we could make the statement that 
"velocity v causes a change in position a;" ; likewise 
dtv = a means that "acceleration a causes a change in 
velocity v" ; and d^x = a means "acceleration a causes 
changes in position x" . 

More general differential equations can be written 
down as weighted sums of different orders of time deriva- 



III. CAUSALITY AND SPECTRA 

Often, the more complicated a model response is, the 
more likely it is that its response will be analyzed in 
the frequency domain. This might be either because 
an experiment has recorded spectral data directly, or 
time varying data S(t) has been converted into a spec- 
trum S(uj) using a Fourier transform 



10 



Although 

it is common to write down the individual sin and co- 
sine Fourier transforms, it is most convenient to combine 
them using e~ lujt = cos(wi) + isin(wi), giving 



S(u) 



1 



V2^ J- 



S{t)e~ lult dt. 



(4) 



Note that even for real-valued S{t), the spectrum S(u) 
can be complex valued. If S(t) is consistent with ca- 
sualty, then its spectrum S(w) must also, and this in- 
sistence that measured spectral data must be consistent 
with causality can be of considerable use O]. So use- 
ful, in fact, that even quite long articles on causality and 
spectra |12[ can get away without any discussion of time- 
domain dynamics at all! 

Let us therefore take our simple eqn. (Q) and either 
Fourier transform it, or take the mathematical shortcut 
of assuming an exp(— iwt) time dependence. Since dtA(t) 
transforms to — iujA(lu), we get 



-iuR(lu) = Q(co) 

R { .)= + m. 



(5) 
(0) 
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The present 

v 




The future 



FIG. 2: A historical record S(t) taken at a time to can only 
contain data prior to that time. 



If Q(t) = QoS(t) is a delta function, then its spectrum is 
a constant with Q{u>) = Qo, so that 



R{uj) = +i— . 

UJ 



(7) 



Since we already know that the solution for R contains 
a step at to, then we now also know (and can check) that 
the Fourier transform of a step function is proportional 
to 1/lo. Returning also to the more general differential 
form in eqn. (J|), we see that since the LHS has higher 
order time derivatives than the RHS, any rearrangement 
to put only R on the LHS (as in eqn. (Q)) will result in 
an RHS that falls off at least as fast as 1/ui. 

An important and useful way of checking and /or en- 
forcing causality on spectra are the Kramers Kronig (KK) 
relations fj], [ll[] . Although derivations are complicated, 
their basic construction is based on two concepts: 



1. The Hilbert transform |10, 13 , is an integral trans- 
form that convolves with the step function h(t). 
This step h(t) establishes or enforces the one sided 
nature of effects that are generated causally: 

(a) A cause at t = to can only have effects R{t) 
that appear for times t > to, so that multiplying 
by h(t — to) has no effect, i.e. R(t) = R(t)h(t — to), 
as on fig. [l| 

(b) A historical record S(t) at a time to only con- 
tains data on the past (t < t ), i.e. S(t) = 
S{t)h(t -t),as on fig. |. 

(c) A linear response function u(t) = u(t)h(t), 
where R(t) = u\t) * Q{t) = J Q °° u(t')Q(t - t')dt' 
only depends on past values of Q(t). The Fourier 
convolution theorum then tells us that R(uj) = 
u(u))Q{uj). 

2. The Fourier transform is based on the exponen- 
tial e~ ltA)t (see eqn. (|j)). It is widely used to re- 
represent a response R{t), time history S(t), or re- 
sponse function u(t) as a spectrum. This requires 
that the Fourier transforms of R, S, or u are well 
behaved enough to exist, which usually requires 
them to be normalizable and to vanish fast enough 



X(t) e {R,u,S} into spectral constraints on the com- 
plex Xkj). Following a well known theorem of Titch- 
marsh |t], Q, we can state that for some causal (i.e. 
step-like) function X(t) which depends only on the past 
(i.e. t < 0), the real and imaginary parts of its frequency 
spectrum are connected by 



X(u) = — 7 



+ 00 



duo' 



(8) 



where the prefactor i serves to cross-link the real and 
imaginary parts of X; often the two parts are written 
explicitly as 



Im 



X(uj) 



X(uj) 



-+oo Im X(oj') 

= -y / — -. -du', 



7T 



UJ — UJ 



u -y 



(9) 



(10) 



The preferred direction for "the past" is set by <j, for a 
response R or u, we set a = +1; for a historical record S 
we use a = — 1. The operator y takes the principal part 
Jl5[ of the integral, returning what we would get for the 
integral if those points at which the integrand diverges 
were skipped. Both cqns. (^) and ( |Io| ) thus inform us 
as to the spectral effect of temporal causality. However, 
it is not necessary to understand the mathematics they 
rely on - i.e. integral transforms and (complex) contour 
integration - in order to appreciate their meaning. 

What the KK relations tell us is that local properties 
are tied to global ones, as noted later on fig. ^. If the 
complex valued X(uj) = represents a response function, 

then the real part (Re X(uj) ) is the dispersion - in op- 
tics, this might be the refractive index change n(uj) — 1, 
whose frequency dependence in glass gives rise to differ- 
ing phase and group velocities Jl6[ for different colours, as 
well as different angles of refraction. The imaginary part 

(Im X(u) ) is then the loss or absorption, or, in ampli- 
fying media (as in a laser), the gain. We can then see that 
the KK relation eqn. ([)]) links the global loss properties 
(Im[A"(w')] for all u>') to the response at a specific fre- 
quency (Re[A > (w)]). Likewise, eqn. (|T(]) links the global 
dispersive properties (Re[X(a/)], for all a/) to losses at a 
specific frequency (Im[A"(u;)]). More generally, any effect 
requiring complicated behaviour for the real part of the 
spectra (usually described as "dispersion" ) usually has 
an associated imaginary component, which for passive 
systems can usually be interpreted as loss. This is the 
origin of commonly made (and not always true) state- 
ments along the lines of "dispersion requires (or induces) 
loss" @ (TJ|. 

IV. A DRIVEN, DAMPED OSCILLATOR 



The Hilbert and Fourier transforms combine to 
turn time-domain "step" restrictions on the real-valued 



Let us now consider a causal response more compli- 
cated than the simple case shown in eqn. (|l|), but both 
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FIG. 3: Typical temporal responses (e.g. either x(t) or 
P(t)) to an impulsive driving force, for a damped oscillator 
in the underdamped (oscilliatory) regime. The initial ramp- 
like response can be seen close to the vertical axis near t — 0. 



simpler and more specific than the very general form of 
cqn. (0). An ideal example is that of a driven, damped 
oscillator such as a mass on a spring Jl9|, ^0|, whose 
temporal differential equation matches that often used 
in electromagnetism _to_ model the Lorentz response in 
a dielectric medium 



21 



. For the mass on a spring, 
with a spring constant k, we have a resonant frequency of 
ujq = yjk/m, and a friction (or "loss") parameter 7; like- 
wise the Lorentz response also has a resonant frequency 
and loss. Given these parameters, the displacement of 
the pendulumn bob x(t) (or dielectric polarization P) 
could then be affected by the driving force per unit mass 
F(t)/m (or electric field E) according to equations of the 
form 



d 2 t x{t) + ~fd t x(t) 
8fP(t) + yd t P(t) - 



h w 2 x(t) 
■<4>P(t) 



F(t)/m, 
ae E(t). 



(11) 
(12) 



Here a delta function impulse in force F(t) = paS(t) 
does not induce an initial step change in position, but 
in velocity dtx(t) ~ Po/ni] with the likewise initial re- 
sponse of a linear (or ramp-like) change in position, with 
x(t) ~ tp /m. In the same way, in an electromagnetic 
Lorentz dielectric medium, an impulsive E(t) = joS(t) 
gives rise to an initial step change in polarization current 
dtP(t) ~ aeojio, an d a concomittant ramp/linear change 
in polarization initially, i.e. P(t) ~ aeatja. Fig. || shows 
some typical oscillating (under-damped) time responses 
to an impulsive driving force. 

Both cqns. (|ll|) and ( p"2"|) are linear, so that the model 
can also be expressed in terms of a response function 
u(t) - e.g. for the dielectric, we would have that P{t) = 
J Q u(t')E{t — t')dt'. We can then Fourier transform this, 
and when the transform of u(t) is denoted u(ui), we have 
that P(u) = u(ui)E(lu). Since the Fourier transform of 
eqn. (n2) is 



?7W 



]] P(u) = ae E(u>), 
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FIG. 4: A typical spectral response w(w) for the damped 
ocillator model. The solid line shows the real part of the 
response, the dashed line the imaginary part. The real part 
of the response at any point (e.g. A at cj ~ 0.6) depends on 
an integral of the imaginary part over all frequencies - i.e. 
over the whole dashed line, uj' £ [0, 00]. Similarly, likewise 
the imaginary part of the response at any point (e.g. B at 
uj ~ 1.4) depends on an integral of the real part over all 
frequencies - i.e. over the whole solid line, w' £ [0, 00]. 



the spectral response u(u>) is then easily obtained, being 

= 2 T° . (I 4 ) 

whose real and imaginary parts are shown on fig. ^. In 
an electromagnetic dielectric u(uS) is related to the re- 
fractive index n by n 2 = 1 + m(cj)/co @- We can see 
from cqn. ( |l4| ) that the real part of u(ui) has a frequency 
dependent variation with an explicit dependence on the 
loss parameter 7. Likewise, the loss-like part of the re- 
sponse, i.e. the imaginary part of u(ui), has an explicit 
dependence on frequency. 

Not all models that we might derive or write down will 
be causal. A contemporary example is the F-model [ p3[ , 
which is used to describe the magnetic response (i.e. the 
magnetization M) of the split-ring resonators (SRRs) 
often used in electromagnetic mctamaterials. While sim- 
ilar to the Lorentz model, its driving term is instead the 
second derivative of the magnetic H field. At frequen- 
cies relevant to typical applications, the F-model works 
well; but for high frequencies where uj 3> u)q, the associ- 
ated short wavelengths cause the approximations used to 
derive it to fail. The F-model follows 



d 2 t M{t) + 7$M(t) 



0J 2 M(t) 



The spectral response based on M(t) 
M{w) = u f (oj)H(uj) is 

u f (uj) = —5 



ad 2 H(t). (15) 
v,F(t) * H{t) or 



«W7 M 



(16) 



(13) 



This isn't KK causal, because eqn. (|l5| ) violates the 
criterion of having lower-order time derivatives on the 
right as compared to the left, and (therefore) eqn. dl6| ) 
fails the test imposed by the Kramers Kronig relations: 
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the response remains finite at arbitrarily high frequen- 
cies. More descriptively, we can say that ever higher 
frequency components of if and M become ever more 
synchronized; and synchronized quantities are related not 
by cause- and- effect, but with an equality. 

We can, however, fix the F-model by defining a new 
auxiliary field K = M — aH. This separates the changes 
in M caused by if, from those synchronized with if, 
by allowing us to merge the two highest-order derivative 
terms. The resulting explicitly causal F-modcl, where if 
and dt if cause changes in K, is now compatible with 
eqn. (0), and follows 



d\ K(f) + -yd t K(t) + u>%K(t) 



-ajd t H(t) - auj%H(§7) 



which has a spectral response uk based on K (t) = 
UK{t) * H(t) or K(u) = uk(ui)H(ui). It vanishes cor- 
rectly at high frequencies, and is 



(18) 



We can use either of eqn. (0) or @ to evaluate K, 
after which we can extract M and find B using M = 
K + aH and the constitutive relation /j.q 1 B = if + 
M = K + (1 + a) if. Although re-expressing the F- 
model in terms of K does not alter the many physical 
approximations made in its derivation, by shifting the 
non-causal part into the constitutive relation for B (an 
equality), it ensures that the dynamic response in eqn. 
( |l7| ) is now explicitly causal. 



V. MAXWELL'S EQUATIONS 

The curl Maxwell's equations control the behaviour of 
the electric and electric displacement fields E(r,t) and 
D(r,t), and the magnetic and magnetic induction fields 
B(r,t) and H(r,t); and depend on a current density 
J(r,t). Although usually written with the curl operator 
on the LHS ^2|, our simple causal model of eqn. ([]]) 
leads instead to 



d t D = V x if - J, 



dtB 



V x E. 



(19) 



These otherwise independent pairs E, B and D, if Q 
are connected together by the constitutive relations in- 
volving the dielectric polarization P and magnetization 
M of the background medium, which are 



D = e E + P, 



B = fj, Q H + fj, M, 



(20) 



and are subject to the constraint imposed by the diver- 
gence Maxwell's equations, which depend on the free elec- 
tric charge density p(r,t) and the zero magnetic charge 
density, and are 



p 



VB = 0. 



(21) 



Perhaps surprisingly, the causal nature of Maxwell's 
equations remains a subject of debate (see e.g. [EH [261). 



Nevertheless, the curl Maxwell's equations must be 
causal in the "step" KK sense: eqns. ( |l9| ) have the 
same form as our simple causal model in eqn. (Q). The 
curl operators, by specifying how the spatial profile of if 
and E drive changes in D and B, turns them into wave 
equations when combined with the constitutive relations. 
Take as a starting point the case in vacuum, where P 
and M are both zero, so that D = eoE and B = [j,qH. 
Then we can then rewrite eqns. ([l9]) solely in terms of 
any pair of electric-like (E or D) and magnetic-like (H 
or B) fields; e.g. 



e d t E = V x if - J, 



Vx£, 



(22) 



which can be combined into second order forms for E 
or B, e.g. eo^odfE = - V x V xE — fj, d t J. While 
these vacuum Maxwell's equations are self-evidently KK 
causal, more generally, the background medium for the 
electromagnetic fields can have non-trivial and dynami- 
cal responses to those fields encoded in P and Af. To 
avoid specifying particular response models, I represent 
all possible causal differential equations for P and M 
(compatible with eqn. (0)) with the notation 



StP = /(• 



StM = /(•). 



(23) 



As an example, 8 t P = f(E) might be used to represent 
eqn. (|l^), the Lorentz response in a dielectric. 

A straightforward expression of Maxwell's equations 
that emphasizes their causal nature is achieved by insist- 
ing that any given field should not appear on both the 
LHS and RHS of the equations. Thus every field that 
has a dynamical response (i.e. is modelled by a tempo- 
ral differential equation) can be updated simultaneously. 
There is no need to follow some specified sequence, al- 
though that can be useful, as in e.g. finite element simu- 
lations p7| . We can even do this even whilst incorporat- 
ing magneto-electric material responses, where the elec- 
tric field affects the magnetization, or the magnetic field 
affects the dielectric polarization. Maxwell's equations, 
written to fit these criteria, are 



d t D = +V x if - J, 6 t M = f(H, E) 



and d t B = -Vx£, 



6 t P=f(E,H) 



(24) 

0- 

(25) 



Although interdependent, these equations remain explic- 
itly KK causal in the sense that if and E are uniquely 
defined as causes, and D, P and B, M arc affected by 
those causes (i.e. show "effects"). Further, we cannot 
regard the displacement current dtD (or indeed its mag- 
netic counterpart dtB) as "causes" in the manner re- 
viewed by Heras these changes in D and B are 
instead effects. 

In typical non magneto-electric cases, where S t M = 
/(if) and S t P = f(E), the two equation sets (24) and 



(B5|) are independent of one another. This further possi- 
ble separation is the reason for associating the equation 
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for M with that for D in cqn. (p4|), and associating that 
for P with B in eqn. @. 

Once the RHS's of eqns. (^J) and (p^) have been eval- 
uated, the LHS's can be integrated directly in an explic- 
itly KK causal manner - the "cause" fields E, H have 
had effects on D,P and B, M. Then, the usual consti- 
tutive relations can be rearranged to connect the fields 
according to 



E = e " 1 [D - P] 



H = no B-M, 



(26) 



to allow us to directly update the the "cause" fields E 
and H . 

Note that if the evolution of M or P were (e.g.) to 
be written as dependent on D or £?, (so that 5tM = 
f(B,D)), then there is no longer a perfect separation 
between "cause fields" and "effect fields". Nevertheless, 
such a rewriting will not violate causality, since the dif- 
ferential equations still have the correct form. 

We can easily replace the abstract current density J by 
incorporating the motion of a particles of mass m,j and 
charge Qj at position Xj(t) with velocity Vj(t), by using 
additional causal equations 



d t Vj(t) 



Qj 



[E( Xj (t),t)+v 3 (t)xB(x 3 (t),t)} (27) 



d t Xj(t) = Vj(t), 



(28) 



along with the connection between the electric current 
density and the particle motion, and the charge density, 
which are 

J(r,t) = Y^qAr-x j (t))v j (t) (29) 

3 

p(r,t)=J2q j 6(r-x j (t)). (30) 

3 

Of course, how light changes its propagation proper- 
ties inside some material is not always described using 
charges, currents, or polarization and/or magnetization. 
Instead we often use its refractive index n(uj), which 
has two distinct contributions. First, there is the vac- 
uum or spacetime metric component, which is unity at 
all frequencies; it is this part that was historically inter- 
preted as an instantaneously responding electromagnetic 
"ether" . Second, there is that due to the polarization and 
magnetization responses (i.e. either n(uj) — 1 or n 2 (w) — I) 
fL6f ), and it is this second component which causality 
(and the KK relations) insist should vanish at high fre- 
quencies. 

VI. WAVES AND SIGNALS 

Causality can be broadened to include the notion of 
"signal causality" - where signals must pass between two 
separate points for influences to be felt. To analyse this 
problem, I take the postion that signals arc sent using 
waves, and that waves follow wave equations. This is 



motivated by the important case of lightspeed signals, as 
propagated according to Maxwell's equations. We will 
see that light propagation is not only "time-step" KK 
causal as discussed in section but "wave-cone" (light- 
cone) causal as well. 

To start, let us convert our simple causal differential 
equation (^) into a wave equation by augmenting the 
cause Q(x,t) by a spatial derivative of the effect R and 
a wave speed c, e.g. 



d t R(x, t) = ±cd x R(x, t) + Q(x, t), 



(31) 



where d x is just the spatial derivative d/dx. Thus we can 
see that waves are (at least in part) their own causes, 
where in particular the "cause" of changes in profile is 
their own pre-existing spatial modulation, in addition to 
any source-like causes Q. To obtain bi-directional prop- 
agation we need either higher order derivatives or multi- 
component waves, e.g. in three dimensions we need at 
least one scalar and one vector component. As for lin- 
earized acoustic waves in pressure and fluid velocity [^9| , 
we could link a scalar R and vector P with a pair of 
differential equations: 



d t R(r, t) = k- 1 V -P(r, t) + Qn{r, t), 



d t P(r,t) 



VR{r,t) + Q P (r,t). 



(32) 
(33) 



These can be combined into either of two second or- 
der forms, the scalar one being the simplest, which with 
Q(r,t) = d t QB,{r 7 t) + K- 1 S7 -Q P (r,t) and c 2 = 1/pn is 



d?R(r, t) = c 2 V 2 R(r, t) + Q{r, t). 



(34) 



This is explicitly KK causal, since the LHS has a second 
order time derivative, whilst the RHS has at most only a 
single time derivative inside Q. Many other wave mod- 
els are possible, notably the transverse electromagnetic 
waves generated by Maxwell's equations (see section ^) , 
for which a very general second order form comparable to 
cqn. (53) exists ]30( ] ; as well as a first order bidirectional 
versiori pif . The hyperbolic form of these second order 
wave equations gives rise to "wave-cones" analogous to 
the light cones discussed in relativistic scenarios. 

Most simple linear waves follow second order equa- 
tions like eqn. (34), although different physical prop- 
erties are denoted by R and Q in the different cases. 
Such waves provide a useful basis for analysing causal 
signalling, since they can be used to establish KK rela- 
tions that constrain spectra in both space and time. It is 
important to choose as a basis those waves travelling at 
the fastest relevant signal speed. Of course the speed of 
light can always fulfill this role, but there may be cases 
when a lower speed limit is appropriate. 

In special relativity, one important frame independent 
quantity for any two spacetime points (i.e. here the origin 
and r , t) is the interval a between them p3 . It also plays 
a crucial role in spacetime causality, since it is equally 
useful for waves in an isotropic material medium, as long 
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FIG. 5: The wave-cone coordinates a and /3 plotted on the 
usual t and r axes. For fixed a, j3 varies along hyperbolae, for 
fixed j3, a varies along (dashed) straight lines in the r, t plane. 

as we remain in the rest frame of that medium. The 
interval is 

a = \J c 2 t 2 — r ■ r = +V~a 2 ~. sgn(i). (35) 

Here I have used the sign of t to distinguish positive in- 
tervals (between now and points in the future wave-cone) 
and negative intervals (between now and points in the 
past wave-cone); imaginary values of a, resulting from 
c 2 t 2 < r ■ r, arc excluded. We also need a rapidity coor- 
dinate 

(3 = arctanh(|r|/ci), (36) 

which indicates the average speed v = \r\/t required to 
travel between a point r,t and the origin. How these 
relate to r, t coordinates is shown on fig. ^. The inverse 
transformation is 

ct = a cosh(/3), (37) 
\r\ = a sinh(/3). (38) 

In special relativity, where c is the speed of light, a and (3 
are related to the Rindlcr coordinates, but are for time- 
like intervals rather than spacelike ones. To complete 
the coordinate set, we need the two polar angles that en- 
able us to reconstruct the individual position coordinates 
r = (r Xl r y ,r z ) from their magnitude r = |r|; using 8 = 
tan -1 (r y /r x ) and <fi = cos~ 1 (r z / \r\). For notational con- 
venience, we can combine f3 with 8, </> to form a single ra- 
pidity vector (3 = (f3, 8, (f)); we might also define pseudo- 
Cartesian coordinates with (3 X = j3 cos 8 cos cf>, f3 y = 
(3 cos 8 sin (/>, and (3 Z = /3sm8 to give (3 = (f3 x , (3 y , (3 Z ). 

In the ID linear case with r = x and S(r, t) = R(x, t), 
or the spherically symmetric case with S(r,t) = rR(r,t), 
the wave equations in both sets of coordinates are 

d 2 S{r,t) = c 2 d 2 .S{r,t) + Q'(r,t), (39) 
a 2 d 2 a S(a, (3) = d 2 S(a 7 (3) + Q'(a, (3), (40) 

where the source terms are written simply as a Q', even 
though they vary between cases. The critical difference 



between the r, t coordinates and the a, (3 ones is that for 
the latter choice, the edge of the wave-cone is coincident 
with a = and is never crossed. Thus "spacelike" inter- 
vals are not covered by these hyperbolic coordinates. As 
a result, only the wave interval a has causal restrictions. 
And whilst eqn. ([39]) is explicitly time-step or KK causal, 
cqn. ( fief ) is something more - it is wave-cone causal (or, 
as named below, HKKK causal). 

When considering the wave equations ( |39| ) and (|4C|), 
note that the cigenfunctions of d 2 , d 2 , or d 2 are just 
plane waves (e.g.) exp(±iKf3) with eigenvalues —K 2 . In 
contrast those of a 2 d 2 are powers of a, where eigenfunc- 
tions a 1 / 2 a ±zI1 have eigenvalues — (n 2 + 1/4) . This means 
that while the Fourier transform is the natural choice for 
a spectral transformation on t, r, or (3, it is not so for a. 
In fact the power-law dependence of the eigenfunctions 
for a indicates that the Mellin transform [[[(J, [33| should 
be the natural choice. 

Nevertheless, the mathematical construction of the KK 
relations depends only on having a step function along 
some coordinate. Thus we can still use the Fourier trans- 
form to construct spacetime KK relations, and so avoid 
for now using the lesser known Mellin transform. Tak- 
ing the Fourier spectral counter part of a to be fi, we 
can rewrite eqn. (||) and get a KK relation consistent 
with the signal causality resulting from the wave equa- 
tion (|34|). Thus for a wave, or some wave-causal quantity 
S(r,t), we have the J7KK relations 

SRffl^fM,,, (41) 

Further, we can also consider the /3-spectral version 
of eqn. (|4l]); where we denote the wavevector-like coun- 
terpart of the rapidity /3 by the symbol K . Since j3 is 
unbounded, Fourier transforming S(£l, (3) into the K do- 
main gives the doubly transformed S(£l,K), satisfying 
51A'KK relations 

J-OO " - Si 

where K = (K,8,<fi). Of course, the angular coordi- 
nates 8, <p could also be transformed into angular spec- 
tra, or we could start with the pseudo-Cartesian (3 = 
(f3 x , f3 v , f3 z ) and transform into its spectral counterpart 
K= (K x ,K y ,K z ). 

Looking back at the definition of a, j3 we see that for 
pointlike objects, the standard KK relations might still 
be applied, since whatever variation in (3 there is merely 
compresses t differently with respect to a, leaving the 
step function intact. Nevertheless, discrepancies between 
true spacetime 17KK causality and simple temporal KK 
causality will exist; e.g. most simply, a pointlike object 
moving with fixed rapidity (3 will have its frequencies 
scaled by a factor of cosh f3 due to time dilation. 

More remarkable is what eqn. (^2|) says about the 
spatial (wavevector) spectra. Only for a narrow range 
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of fixed times in the past, where an object is well local- 
ized inside the wave-cone (so that Ar <C r and At -C t), 
can an ordinary spatial spectra S(k) based on an S(r) be 
made to match up with the (wave-cone) rapidity spec- 
tra S(K) based on S(j3). Nevertheless, in typical, non- 
rclativistic, laboratory situations with lightspeed com- 
munication, we can quite reasonably restrict our analysis 
to that small region of agreement. Otherwise, our lack 
of knowledge about the situation outside the wave-cone, 
and the mismatch between spatial coordinate r and the 
spatiotemporal /? means that since S(r) looks nothing 
like S(/3), their respective spectra S(k), S(K) are also 
utterly different: a structure periodic in r is not periodic 
in j3. 



VII. SIMULTANEITY, LOCALITY, AND 
SMOOTHNESS 

Two points raised by the stance of Jefimenko |2l| ques- 
tion (a) whether the use of local causality is appropriate, 
and (b) if causes and effects can be simultaneous. 
Local vs retarded causes: The local causality used here 
works knowing only the current state and current influ- 
ences, and makes only minimal assumptions. In contrast, 
some (e.g. Jefimenko [|6|, also see Heras prefer 
to relate effects back to their original causes. Thus the 
electromagnetic fields would be directly obtained from 
the integral equations over the past behaviour of charges 
and currents (see eqns. (7,8) in [^6|). From a practical 
perspective, this can raise difficulties: we often want to 
solve electromagnetic problems for free fields on the basis 
of some stated initial boundary conditions - where we do 
not know, nor want to calculate, whatever sources may 
have been required to generate them. KK causality, be- 
ing local, neither knows or cares about this "deep" past; 
but casuality is still enforced and remains testable. How- 
ever, this deep past is not irrelevant, and the assumption 
that fields can be related back to sources is an important 
one @. 

Simultaneity & smoothness: We might take the posi- 
tion that having any part (however infinitesimal) of the 
effect simultaneous with the cause is unsatisfactory; e.g. 
as does Jefimenko [|6| with regard to Maxwell's equa- 
tions, and by implication even the simple eqn. ([!]), even 
though both are KK causal (even if they might not be 
"Jefimenko causal" as well). Mathematically, the step 
response can be made non-simultaneous by use of t > to 
rather than t > to, but since this has negligible physical 
consequences, we might prefer to remove the simultaneity 
by demanding a smooth response. This can be instituted 
by replacing the LHS of eqn. ([[]) with the second order 
&l R, so that a delta function impulse gives a linear-ramp 
instead of a step response. Then we would no longer 
use dtx = v to say that "velocity causes change in posi- 
tion" , but instead use d\x = a and say that "acceleration 
causes change in position" . However, if we demand this, 
then we find that the curl Maxwell's equations - with 



their single time derivative - no longer count as causal; 
and other wave equations (e.g. (|3l|), (32), ([||)) suffer a 
similar fate. 



VIII. SUMMARY 

KK causal models are constructed as temporal differ- 
ential equations where the changes with time of the ef- 
fect depends on the strength of some cause. The simplest 
possible model is written down as eqn. ([l]), containing a 
cause Q, something to be affected R, and a single time 
derivative applied to R. In such a case an impulsive cause 
leads to a step-like effect. For example, in kinematics, the 
equation dtx = v, allows us to make the statement that 
"velocity v causes a change in position x v . In contrast, 
writing down e.g. F = ma does not allow me to claim ei- 
ther that "ma causes F" or "F/m causes a". But I could 
instead write dtv = F/m and then make the KK causal 
statement "a given F/m causes a change in velocity u". 
This argument then shows that Maxwell's equations are 
unambiguously causal in the KK sense, whether or not 
any useful alternative definitions of causality exist. 

It is well known that a spectral analysis of "time- 
step" causality gives rise to the Kramers Kronig rela- 
tions, which apply constraints to measured spectral data; 
and link the real and imaginary parts of the spectra in a 
global-to-local manner. Here I have broadened the notion 
of "time-step" KK causality and developed "wave-cone" 
S1KKK relations for signal causality in spacetime, a for- 
mulation compatible with not only Maxwell's equations, 
but most simple wave equations expressible in a second 
order form. 

Lastly, the definition of KK or fiKKK causality allows 
us to see the validity of Norton's argument that it 
is not a priori necessary to add causality as an extra 
assumption to physical models. Instead, we should take 
any given model and determine whether or not it is com- 
patible with causality by its construction - i.e. from its 
differential equation(s). 



IX. APPENDIX: A DISCRETE PICTURE 

We can also motivate the basic causal prescription of 
section |l| using a discretized argument of the type used 
to numerically solve differential equations. This is intro- 
duced by the diagram in fig. ^|. 

For the simple causal model dtR = Q defining an ef- 
fect on R due to the cause Q, the information needed 
to calculate the next value of Ri, i.e. R n +i, consists of 
the known Q n and the past (known) value of R n -i- For 
an alternative causal model &fR = Q, the information 
needed to calculate R n +i, consists of the known values 
Q n , R n and R n -i- Again, this demonstrates that we can 
calculate the future (i?„+i) using only known - current 
or past - information. 
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„ n-2 n—1 n n+1 n+2 n+3 n+4 

tij —m • • • o o o o o— 

• 1 o d,R = Q„ = (R„ +l - R„_,) 1 2A 

j R„ + ,= 2AQ, 

• f ° dfR = Q, = (R n+] - 2R„ + R„_, )/ A 2 

\ R„ +1 = A 2 Q n + 2R„ - R„_] 

FIG. 6: Here the filled circles denote known information, 
while unknown information is denoted by open circles. All of 
the causes Qi on the top line are specified by the model, and 
so are "known". Knowledge of Ri, indicated on the second 
line, has to be computed, and so at some index i = n we only 
know Ri up to R n , and not R n +i and later. The lower parts 
(blue, then green at the bottom) compare two simple causal 
models, as discussed in the text. 

It is always the highest order discrete derivative of R 
in the model which reaches furthest back and forwards in 
time when being calculated; and it is the most future-like 
component which we need to calculate, but only on the 
basis of known information. We can see on fig. [3] how 
to do the necessary rearrangements for the two simple 
models; but the general procedure is to discretize the en- 



tire differential equation around some central time-index 
i = n and then rearrange so that the most future-like 
term is on the LHS with the rest on the RHS. In the 
case of a third or fourth order derivatives centered on 
i = n, the most future term will be R n +2, which might 
seem problematic. However, the fix is simple - we just 
re-centre on n — 1 so that the most future term will in- 
stead become the desired Rn+i- Happily, we can apply 
this re-centering as required using even larger shifts, if 
still higher order derivatives are present. As a result we 
can always explicitly calculate the future (R n +i) using 
only known information from i < n. 

We might speculate that there could perhaps be a way 
to invert this scheme and instead denote d t R as the cause 
and Q the effect. However, even for the simple models, 
we see that to calculate Q n we need R n +i- Although all 
of Ri might indeed be pre-specified by our model, it is 
nevertheless true that when evaluating Q n we should not 
(yet) have access to R n +i - that value of "cause" has 
not been expressed to R yet. Thus we cannot consider R 
as the cause, because we insist that only current or past 
information known to what is being affected can influence 
it. 
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